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Draft Genome Sequence of Sphingobium sp. Strain BHC-A, Revealing 
Genes for the Degradation of Hexachlorocyclohexane 
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Here, we report the draft genome sequence of Sphingobium sp. strain BHC-A, a lin gene-based hexachlorocyclohexane (HCH)- 
degrading strain, isolated from soil that suffered long-term HCH contamination in an insecticide factory. 
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Technical hexachlorocyclohexane (t-HCH) mainly consists of 
a-, j8-, y-, and S-isomers (1, 2). Only y-HCH has insecticidal 
activity. Both t-HCH and -y-HCH were used worldwide between 
the 1940s and the 1990s to control a wide range of agricultural, 
horticultural, and public health pests. Now, they are banned or 
restricted in most countries because of their toxicity and persis- 
tence in the environment. However, these insecticides are still be- 
ing manufactured and used as a cheap but effective insecticide in 
some developing countries, mainly for economic reasons (3). 
Therefore, residue problems of HCH will continue for a long time. 
Bioremediation has become a viable and promising biotechnolog- 
ical approach to cleaning up polluted environments. An aerobic 
bacterium, Sphingobium sp. strain BHC-A, was isolated from 
soil that suffered long-term HCH contamination in an insecti- 
cide factory (4). This strain degraded a-, j3-, y-, and 5-HCH. 
The linABCDEFR genes have been cloned, and they all show 
high identity to the counterparts of Sphingobium japonicum 
UT26, except linB. The LinB proteins of strain BHC-A have 
seven heterogeneous amino acid residues in sites similar to 
those of LinB from UT26, which determine the special role of 
LinB in the transformation of /3- and 8-HCH (5, 6). To further 
investigate the degradation pathways of j3- and S-HCH as well 
as other recalcitrant xenobiotics, we sequenced the genome of 
strain BHC-A. 

Genome DNA was extracted using the Qiagen Genomic-tip 
20/G kit, following the manufacturer's instructions. Paired-end 
genome sequencing was performed for a 350-bp DNA fragment 
on an lUumina Solexa sequencing platform at the Beijing Genome 
Institute, Shenzhen, China. In total, 632.64 Mb of raw data was 
obtained and 500 Mb of data passed quality trimming. The ge- 
nome was assembled by Velvet (v 1.2.10) (7) with a 31-basek-mer, 
resulting in 2 1 7 scaffolds larger than 500 bp. The maximum length 
of these scaffolds was 132,651 bp. The total length of the genome 
was estimated to be 4,063,496 bp, with a GC content of 65.3%. 
Genome annotation was performed using the RAST server with 
the SEED database (8) and the NCBI Prokaryotic Genomes 



Automatic Annotation Pipeline (http://www.ncbi.nlm.nih.gov 
/genomes/static/Pipeline.html) . 

The chromosome of the strain harbors 3,537 protein-coding 
sequences, 391 subsystems, 45 tRNAs, and 2 rRNAs. Fourteen lin 
genes, including linBCDEFGHIJKLMNR, were annotated and lo- 
cated in the draft genome. However, linA was not found, and we 
speculate that linA is in the gaps, because its length is only 471 bp 
and our previous study also proved the existence of linA (5). Data 
and information from this study will be used and a comparative 
analysis with other genomes of HCH-degrading sphingomonads 
(9-17) will be carried out to increase understanding of HCH deg- 
radation, especially the evolution and acquisition of lin genes in 
sphingomonads. 

Nucleotide sequence accession numbers. The draft genome 
sequence of Sphingobium sp. BHC-A has been deposited in Gen- 
Bank under the accession number JDRUOOOOOOOO. The version 
described in this paper is version JDRUOIOOOOOO. 
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